Hierarchy-based semantic embeddings for single-valued & multi-valued categorical variables

نویسندگان

چکیده

Abstract In low-resource domains, it is challenging to achieve good performance using existing machine learning methods due a lack of training data and mixed types (numeric categorical). particular, categorical variables with high cardinality pose challenge tasks such as classification regression because requires sufficiently many points for the possible values each variable. Since interpolation not possible, nothing can be learned seen in set. This paper presents method that uses prior knowledge application domain support cases insufficient data. We propose address this by embeddings are based on an explicit representation (KR), namely hierarchy concepts. Our approach 1. define semantic similarity measure between categories, hierarchy—we purely hierarchy-based measure, but other measures from literature used—and 2. use modified one-hot encoding. two embedding schemes single-valued multi-valued perform experiments three different cases. first compare approaches our word pair case. followed creating approaches. A comparison Google, Word2Vec GloVe several benchmarks shows better concept categorisation when knowledge-based embeddings. The third case medical dataset semantic-based standard binary encodings. Significant improvement downstream achieved information.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Preference Function Learning over Numeric and Multi-valued Categorical Attributes

One of the most challenging goals of recommender systems is to infer the preferences of users through the observation of their actions. Those preferences are essential to obtain a satisfactory accuracy in the recommendations. Preference learning is especially difficult when attributes of different kinds (numeric or linguistic) intervene in the problem, and even more when they take multiple poss...

متن کامل

Automatic preference learning on numeric and multi-valued categorical attributes

One of the most challenging tasks in the development of recommender systems is the design of techniques that can infer the preferences of users through the observation of their actions. Those preferences are essential to obtain a satisfactory accuracy in the recommendations. Preference learning is especially difficult when attributes of different kinds (numeric or linguistic) intervene in the p...

متن کامل

Common fixed point theorem for nonexpansive type single valued mappings

The aim of this paper is to prove a common fixed point theorem for nonexpansive type single valued mappings which include both continuous and discontinuous mappings by relaxing the condition of continuity by weak reciprocally continuous mapping. Our result is generalize and extends the corresponding result of Jhade et al. [P.K. Jhade, A.S. Saluja and R. Kushwah, Coincidence and fixed points of ...

متن کامل

Nonlinear Viscosity Algorithm with Perturbation for Nonexpansive Multi-Valued Mappings

In this paper, based on viscosity technique with perturbation, we introduce a new non-linear viscosity algorithm for finding a element of the set of fixed points of nonexpansivemulti-valued mappings in a Hilbert space. We derive a strong convergence theorem for thisnew algorithm under appropriate assumptions. Moreover, in support of our results, somenumerical examples (u...

متن کامل

Fixed Point Theorems for Compatible Multi-valued and Single-valued Mappings

The notion of compatibility for point-to-point mappings recently defined by Jungck is generalized to include multi-valued mappings. This idea is used to establish a fixed point theorem for a generalized contractive multi-valued mapping and a single-valued napping.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Intelligent Information Systems

سال: 2021

ISSN: ['1573-7675', '0925-9902']

DOI: https://doi.org/10.1007/s10844-021-00693-2